Video speed detection system

ABSTRACT

Multiple-object speed tracking apparatuses are disclosed, including a camera configured to capture a set of images of a monitored area (e.g., a roadway). The camera&#39;s longitudinal axis may be positioned at any viewing angle relative to a longitudinal axis of a roadway such that at least two moving objects moving on the roadway are included in a set of high or low resolution images. A computer system is configured to analyze the set of images to detect the two moving objects and substantially simultaneously determine a calculated rate of speed of at least one of the two moving objects. The computer system also provides an on-site speed calibration process for transforming locations of an image among the set of images into real-world coordinates by considering both perspective and scale of the image. An apparatus mount for at least one of either the camera or the computer system is also disclosed.

RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 13/178,471, filed Jul. 7, 2011, entitled “VIDEO SPEED DETECTION SYSTEM”; which is a continuation of U.S. patent application Ser. No. 11/970,371, filed Jan. 7, 2008, entitled “VIDEO SPEED DETECTION SYSTEM”; which claims priority under 35U.S.C. §119(e) to U.S. Provisional Patent Application No. 60/883,641, filed Jan. 5, 2007 entitled “NTS ViDAR™ Speed System”; each of which is hereby incorporated by reference herein in its entirety.

TECHNICAL FIELD

This disclosure relates generally to measurement of vehicle speeds and, more particularly, to a system and method for measuring vehicle speeds using video sensing.

BACKGROUND INFORMATION

The traditional approach to speed limit enforcement involves police officers observing the traffic and then chasing, stopping, and issuing citations to drivers who speed. The more modern approach to speed enforcement involves the use of a speed measuring device for measuring vehicle speeds and triggering a high speed camera to take photographs of speeding vehicles. The recorded photographs are then used to identify the registered vehicle owners or actual drivers of offending vehicles, to whom citations would be issued. Regardless of the approach adopted, the ability to provide accurate vehicle speed measurements is crucial for any speed law enforcement device. A number of methods exist for measuring vehicle speeds.

VASCAR, which stands for “visual average speed computer and recorder,” is an average speed calculator that measures and displays vehicle speed based on the time a target vehicle takes to travel a known distance. Due to its semi-manual operation nature, the accuracy of VASCAR is subject to human errors and is often challenged in court.

Inductive loop sensors are commonly used to detect the presence/absence of a vehicle based on the inductance changes caused by movements of a vehicle in the vicinity of the loop sensors. There are several systems that attempt to measure the speed of a vehicle by analyzing the waveforms of the output signal as the detected vehicle passes the sensor. One simple method is to use two inductive loops and calculate vehicle speeds by measuring the distance between the two loops and the difference in the time of detection of the vehicle at the two loops. For inductive loops to work, the sensors have to be placed beneath the road surface. Therefore the requirement to close lanes of traffic for installation, and cost of installation, can make them undesirable under certain circumstances.

RADAR, which stands for “radio detection and ranging”, is the technology that uses electromagnetic radio waves to detect moving vehicles and measure their speeds. A radar gun can be used to aim a beam of radio waves of a specific frequency at a vehicle and receive the returned signal as it is reflected off the target vehicle. The relative speed of the source of the emitted electromagnetic signal and the target vehicle causes the frequency of the returned signal to shift, which is known as the Doppler Effect. The magnitude of the Doppler shift is proportional to the relative speed between the source and the target vehicle. Therefore, the speed of a target vehicle can be inferred by comparing the frequency of the reflected signal to that of the emitted signal. Although radar is accurate in determining the speeds of relatively isolated vehicles, it is less effective at differentiating various vehicles in a cluttered environment, due to the relatively large beam size. In addition, radio signals can be easily detected by radar detectors. Therefore, drivers can use radar detectors to detect the presence of radar speed measuring devices and slow down temporarily to avoid being ticketed.

An alternative technology, LiDAR, uses pulsed laser light instead of radio waves to measure vehicle speeds. A LiDAR device can aim a pulsed narrow beam of light at a target vehicle and measure the time it takes to receive the reflected signal. Knowing the speed of light, the range of the target vehicle can be calculated. If the laser pulse is applied continuously and the target vehicle is moving, the range will change with time and the difference in time it takes the transmitted signal to strike the target and return can be used to measure the speed of the target vehicle. The laser beam is much narrower compared to the beam width of radar systems. Therefore, LiDAR can pinpoint to a particular target and is also more difficult to detect.

It is crucial in speed law enforcement that vehicle speeds obtained by speed measuring devices are accurate and credible. It is typically recommended that speed measuring devices undergo certain rigorous testing and certification procedures to ensure their accuracy and compliance to defined specifications. In addition, personnel that operate speed measuring devices are usually required to receive proper training on the correct operation of the devices. However, even with all the precautious procedures to ensure proper functioning of speed measuring devices in place, measurement errors are unavoidable due to the intrinsic limitations associated with each speed measuring method.

SUMMARY OF THE DISCLOSURE

As discussed above, in modern automated speed law enforcement, a speed measuring device is often supplemented with enforcement cameras that take photographs of speeding vehicles. The device measures vehicle speeds and triggers the enforcement camera(s) to take photographs of vehicles traveling above a predetermined speed limit However, while the recorded photographs can be used to identify the registered owner or even the driver of the offending vehicle, they cannot be used to reproduce the speed of the vehicle based on which the citation is issued. Therefore, it is of immense advantage to have a system that is not only capable of accurately measuring vehicle speeds in real time but also is capable of reproducing the vehicle speeds from recorded video evidence.

Accordingly, aspects and embodiments are directed to a system that can both accurately measure vehicle speeds in real time and reproduce the vehicle speeds either manually or automatically from the recorded video evidence. According to one embodiment, the system comprises a tracking camera and a processing unit. The tracking camera is capable of generating accurately time-stamped images of the monitored road and streaming them to the processing unit at a high frame rate. The processing unit processes each frame it receives from the tracking camera to detect and identify moving vehicles, and continues to track the detected vehicles through subsequent frames until the vehicles exit the tracking camera's field of view. According to one embodiment, the system allows an operator to mark four reference points on the road surface to define a parallelogram zone. It then establishes the projective mapping between image coordinates and real-world coordinates from the four point correspondences. While the system tracks detected vehicles as they pass through the monitored road segment, it keeps updating the vehicles' position information by projecting their image coordinates back into real-world coordinates, and extracts time information from the date and time stamp associated with each image frame. The system then calculates a vehicle's speed by dividing the distance the vehicle travels by the time it spends to cover the corresponding distance. As a vehicle exits the monitored zone, the system compares its exit speed to the trigger speed set by the operator and triggers the tracking camera to record a video clip showing the vehicle entering and leaving the camera's field of view. In addition, the system can trigger supplemental high-resolution cameras to take photographs of the speeding vehicle so that the registration or driver information of the vehicle can be obtained. With the recorded time-stamped video clip, the speed of a vehicle can be reproduced using the same algorithm offline, or a speed estimate can be generated manually from the time-stamped video using the basic time-over-distance method.

According to one embodiment, a system for monitoring vehicles or other objects, may comprise a tracking camera constructed and arranged to provide a plurality of time stamped images of a measurement area, and a processing unit, coupled to the tracking camera, that is configured to receive the time stamped images and to process the images to determine from the time-stamped images a speed of a vehicle passing through the measurement area. In one embodiment, the system may further comprise a high resolution camera coupled to the processing unit that is configured to capture an image of the vehicle in response to a signal from the processing unit. In one example, the processing unit is configured to print date, time, speed, and a reticule targeted at the vehicle to the image from the high resolution camera. In another example, the processing unit is configured to receive a trigger speed value from a user of the system. The processing unit may be configured to compare the speed of the vehicle to the trigger speed and to send a trigger signal to the high resolution camera when the vehicle speed exceeds the trigger speed. In another example, the system may further comprise a storage device, and the processing unit may be further configured, in combination with the storage device, to record the time-stamped images of a vehicle whose speed exceeds the trigger speed as this vehicle travels though the measurement area. The system may be further constructed and arranged to record time-stamped images of the vehicle passing through the measurement area, and the processing unit may be further configured to receive the recorded time-stamped images of the vehicle and determine the speed of the vehicle from the recorded time-stamped images.

In another example, the processing unit may be further configured to provide a graphical user interface (GUI) to the user of the system. The GUI may be configured to receive the actual dimension of the measurement area from a user of the system. The GUI may be also configured to provide live video of the measurement area from the tracking camera. In one example, the processing unit may be is configured to continuously update the speed of a vehicle and to display the speed of the vehicle at the position of the vehicle in the video. In another example, the GUI may be further configured to display the recorded time stamped images of a vehicle. The processing unit may be further configured to allow the user of the system to step through the recorded images and determine the frames at which a vehicle passes two reference lines separated with a known distance. In addition, the processing unit may be further configured to determine the speed of a vehicle from the distance between the two reference lines and the difference in the time stamps of the frames at which the vehicles pass the two reference lines.

In another example, the processing unit may be further configured to construct rectified images of the measurement area from the plurality of time-stamped images. The rectified images of the measurement area may be constructed by determining a projective mapping from four point correspondences. In one example, the processing unit may be further configured to map coordinates of at least one of the plurality of time-stamped images to actual coordinates of the measurement area using projective mapping. In another example, the processing unit may be further configured to extract a background image of the measurement area from the plurality of time-stamped images. In one example, the background image may be extracted by temporal median filtering of the plurality of time-stamped images. The processing unit may be further configured to detect and identify vehicles within the measurement area. In one example, the processing unit may be configured to detect the vehicles by segmenting the rectified image and determining large deviations in the rectified image.

In another example, the tracking camera may be configured to provide the images at a frequency of at least 15 frames per second. In another example, the processing unit may further comprise an optical character recognition (OCR) component configured to determine the time stamp from each image of the plurality of time-stamped images. Furthermore, the processing unit may be configured to determine a distance the vehicle has traveled between two time-stamped images. In one example, the processing unit may be configured to continually determine the speed of the vehicle between two time-stamped images as vehicle travels through the measurement area. In another example, the processing unit may be configured to monitor multiple vehicles in the measurement area and measure their speeds simultaneously. In another example, the processing unit may be further configured to collect traffic statistics of the measurement area including any of number of vehicles passed through and average vehicle speed.

According to another embodiment, a method of traffic monitoring may comprise acts of collecting a plurality of time-stamped images of a monitored area; analyzing the plurality of time-stamped images to detect an object moving through the monitored area; and determining a speed of the object. In one example, the monitored area may include a section of a road and the object may be a vehicle.

According to another embodiment, a method of monitoring vehicles or other objects may comprise acts of obtaining a plurality of time-stamped images of a measurement area, and processing the plurality of time-stamped images to determine from the time-stamped images a speed of a vehicle passing through the measurement area. The method may further comprise capturing a high resolution image of the vehicle. In one example, the method may further comprise printing date, time, speed, and a reticule targeted at the vehicle to the high resolution image. In another example, the method may further comprise receiving an input trigger speed value. In addition, the method may comprise comparing the speed of the vehicle to the trigger speed and sending a trigger signal to a high resolution camera to obtain the high resolution image when the vehicle speed exceeds the trigger speed. In another example, the method may further comprise storing the plurality of time-stamped images of a vehicle whose speed exceeds the trigger speed as the vehicle travels though the measurement area.

In one example, the method may further comprise receiving recorded time-stamped images of the vehicle; and determining the speed of the vehicle from the recorded time stamped images. In another example, the method may further comprise receiving the actual dimension of the measurement area from a user via a graphical user interface (GUI). In another example, the method may further comprise displaying live video of the measurement area via a graphical user interface (GUI). The method may further comprise continuously updating the speed of a vehicle and displaying the speed of the vehicle at the position of the vehicle in the video. In another example, the method may further comprise displaying recorded time-stamped images of the vehicle via a graphical user interface (GUI). The method may further comprise allowing a user to step through the recorded time-stamped images and determine the frames at which the vehicle passes two reference lines separated by a known distance. In addition, the method may comprise determining the speed of the vehicle from the distance between the two reference lines and the difference in the time stamps of the frames at which the vehicle passes the two reference lines.

In another example, the method may further comprise constructing rectified images of the measurement area from the plurality of time-stamped images. In one example, constructing the rectified images may include determining a projective mapping from four point correspondences. In another example, the method may further comprise mapping coordinates of at least one of the plurality of time-stamped images to actual coordinates of the measurement area using projective mapping. In another example, the method may further comprise extracting a background image of the measurement area from the plurality of time-stamped images. In one example, extracting the background image includes extracting the background image by temporal median filtering of the plurality of time-stamped images. Furthermore, the method may detect and identify vehicles within the measurement area. In one example, detecting the vehicles includes segmenting at least one of the rectified images and determining large deviations in the at least one rectified image. In another example, the method may further comprise determining the time stamp from each image of the plurality of time-stamped images using an optical character recognition (OCR) component. The method may also comprise determining a distance the vehicle has traveled between two time-stamped images. In another example, the method may further comprise continually determining the speed of the vehicle between two successive time-stamped images as vehicle travels through the measurement area. In yet another example, the method may comprise monitoring multiple vehicles in the measurement area and measuring their speeds simultaneously. In another example, the method may also comprise collecting traffic statistics of the measurement area including any of a number of vehicles passed through and average vehicle speed.

Additional aspects and advantages will be apparent from the following detailed description of preferred embodiments, which proceeds with reference to the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

Aspects and embodiments of this disclosure are described in detail below with reference to the accompanying drawings. It is to be appreciated that the drawings are not intended to be drawn to scale. In the drawings, each identical or nearly identical component that is illustrated in various figures is represented by a like numeral. For purposes of clarity, not every component may be labeled in every drawing. In the drawings:

FIG. 1 is a schematic block diagram of one example of a system according to aspects of the present disclosure;

FIG. 2 is an example of a typical field of view from a tracking camera;

FIG. 3 is a flowchart of one example of a method for measuring vehicle speeds and photo enforcement, according to aspects of the present disclosure;

FIG. 4 is an illustration of a two-dimensional projective mapping between image and real-world coordinates, according to aspects of the present disclosure;

FIG. 5 is an illustration of a rectangle grid projected and overlaid to the image of the road, according to aspects of the present disclosure;

FIG. 6A is an example of an image captured by the tracking camera showing a vehicle on a section of road defined by reference markers, according to aspects of the present disclosure;

FIG. 6B is an example of a corresponding rectified image of the road surface defined by the reference markers;

FIG. 7A is an illustration of an example background image, according to aspects of the present disclosure;

FIG. 7B is an illustration of an object mask, corresponding to the image of FIG. 7A, segmenting the foreground and background pixels, according to aspects of the present disclosure;

FIG. 8A is an example of an image frame of a vehicle entering a monitored road segment;

FIG. 8B is an example of an image frame of the vehicle leaving the monitored road segment;

FIG. 8C is a rectified image corresponding to the image frame of FIG. 8A;

FIG. 8D is rectified image corresponding to the image frame of FIG. 8B;

FIG. 8E is an object mask corresponding to the rectified image of FIG. 8C;

FIG. 8F is an object mask corresponding to the rectified image of FIG. 8D;

FIG. 9 is an example of a synthesized composite image illustrating how the average speed of the vehicle between the two frames in FIGS. 8A and 8B can be calculated;

FIG. 10 is an example of a screen shot displayed on a graphical user interface, according to aspects of the present disclosure; and

FIG. 11 is another example of a screen shot that, in combination with FIG. 10, illustrates an example of a mechanism for manually verifying vehicle speeds using the time-over-distance method.

DETAILED DESCRIPTION OF EMBODIMENTS

This disclosure is not limited in its application to the details of construction and the arrangement of components set forth in the following description or illustrated in the drawings. The disclosure is capable of implementation in other embodiments and of being practiced or of being carried out in various ways. Examples of specific implementations are provided herein for illustrative purposes only and are not intended to be limiting. In particular, acts, elements and features discussed in connection with any one or more embodiments are not intended to be excluded from a similar role in any other embodiments. In addition, it is to be appreciated that the phraseology and terminology used herein is for the purpose of description and should not be regarded as limiting. The use herein of “including,” “comprising,” “having,” “containing,” “involving,” and variations thereof is meant to encompass the items listed thereafter and equivalents thereof as well as additional items.

FIG. 1 is a schematic block diagram illustrating one example of a typical setup of a system for speed measuring and enforcement, according to aspects of the present disclosure. In the illustrated embodiment, the system comprises a tracking camera 100 aimed at a segment of road 102, and a processing unit 106 coupled to the tracking camera 100. In the example illustrated in FIG. 1, the processing unit 106 is represented as a computer. However, it is to be appreciated that the processing unit may be any type of processor capable of running software and communicating with the tracking camera 100 and, optionally, an enforcement camera 112. The monitored segment of the road 102 is defined by four retro-reflective markers 104 placed along both sides of the road 102 so as to define an area on the road surface. In one example, the area is a rectangle, as illustrated. The following discussion will refer primarily to a rectangular monitored area; however, it is to be appreciated that the present disclosure is not so limited and the markers 104 may be used to define an area shaped other than as a rectangle. The width and length of the rectangle are measured and used as camera calibration parameters, as discussed further below. The tracking camera 100 has a field of view 108, and generates images with accurate time stamps at a predefined frame rate. According to one embodiment, a software engine running on the processing unit 106 retrieves images transmitted from the tracking camera 100. The software engine may allow the processing unit 106 to display the transmitted images live in a graphic user interface (GUI). The software engine may also analyze the images to detect moving vehicles 110 in field of view 108 of the tracking camera 100, and may calculate the speeds of the vehicles in real time.

The tracking camera 100 is not limited to any particular design or architecture. Any off-the-shelf camera that can deliver high quality images of the monitored road may be used. In a presently preferred embodiment, the camera should deliver the images at a minimum of 15 frames per second. To achieve high accuracy in speed measurement and legal credibility in court, in one embodiment it is recommended that the camera has the capability of generating highly accurate time stamps and inserting them into the images. Alternatively, other timing devices may be used to generate accurate timing information associated with each image frame. In another embodiment, the time at which the processing unit 106 receives each individual frame may be used as the time stamp. However, due to the unpredictable delay that may occur in the transmission process, the receiving time may exhibit greater variation than the time at which individual image frames are generated. Therefore, speed measurements obtained this way may be less accurate. Of course, the more accurate the time stamps, the more accurately the speed of the vehicle 110 can be calculated. Therefore, very accurate time stamps may be advantageous for at least some applications. Higher frame rates will also help improve the accuracy of the calculated speeds. Accordingly, in one example, the tracking camera 100 generates 30 images per second and each image is time-stamped to one one-hundredth of a second.

Referring again to FIG. 1, in one embodiment the system also includes an enforcement camera 112. The enforcement camera 112 maybe supplemental to the speed measuring components of the system, and may be used only for taking photographs of speeding vehicles so that registration or driver information can be obtained for enforcement purposes. According to one embodiment, characteristics of the enforcement camera 112 include that it can be triggered by signals from the processing unit 106 to take photos of the speeding vehicle 110, and that the image quality of the camera allows for identification of, for example, the registration number of the vehicle 110 or the driver's image, as may be required for enforcement purposes.

Referring to FIG. 2, there is illustrated an example of a typical tracking camera field of view when the system is used for vehicle speed measuring and enforcement in a mobile setup. In FIG. 2, the four cones 114 represent the four reference points (equivalent to markers 104 in FIG. 1) that define a rectangular region along the road 102. As discussed above, the width and length of the road segment defined by the cones 114 are measured and entered into the software engine for calibration of the tracking camera. The date and time stamp 116 can be found at the upper-left corner of the image in FIG. 2, and marks the time of the image frame to a specified resolution, for example one one-hundredth of a second. In one embodiment, for example, when the system is used for measuring vehicle speeds in a fixed setup, the tracking camera can be mounted on a pole or overhead master arm to achieve better resolution and field of view.

According to one embodiment, the system includes a software engine that runs on the processing unit 106 (see FIG. 1) to perform image processing, speed calculation, triggering, and video recording. As discussed above, the processing unit 106 is not limited to any particular design or architecture. The processing unit can be a laptop, a desktop or single-board computer (SBC) that is integrated with the rest of the system. In one embodiment, the processing unit includes basic peripheral devices such as memory, hard drive, Ethernet and/or other I/O communication ports to run the software engine and to interface with the cameras and the user. Depending on the interface, the cameras can be connected to the processing unit through, for example, an Ethernet router, a USB port or other ports. The whole system can be powered up by an external power adapter or by an internal battery. In one embodiment, a Kontron JRex-PM single board computer (SBC) is used for the processing unit 105. In one example, the SBC is equipped with an Intel Celeron 800 MHz processor and supplied with 512 MB RAM and 40 GB hard drive. However, the present disclosure is not limited to these examples and other processors that are capable of running the system software engine and interfacing with the cameras can be adapted for the same purpose.

According to one embodiment, the software engine retrieves images from the tracking camera, processes the images to detect and identify vehicles in the predefined zone, and calculates speeds of detected vehicles. Referring to FIG. 3, there is illustrated one example of a logical flow diagram for operation of the software engine. Aspects and embodiments of a vehicle speed detection and enforcement system according to the present disclosure are discussed below with continuing reference to FIG. 3.

As discussed above, in a first step 200, a user may enter the length and width of the monitored area (step 200). The dimensions of the monitored area are used to define reference points that enable the software engine to correlate points in the images captured by the tracking camera with points on the actual road surface, as discussed below.

By definition, the average speed of a vehicle can be calculated based on the time over-distance method:

$\begin{matrix} {{Speed} = \frac{D}{T}} & (1) \end{matrix}$

where D is the distance the vehicle travels and T is the time it takes to travel the distance. Therefore, in order to calculate the speed of a vehicle, one may measure how much distance the vehicle has traveled in a certain amount of time. With each image frame from the tracking camera being accurately time stamped, the time lapse T between two arbitrary frames can be easily calculated as

T=T ₂ −T ₁  (2)

where T₁ and T₂ are the respective time stamps associated with the two image frames. While it is relatively easy to measure T based on the time stamp in each image frame from the tracking camera, it is difficult to measure the distance that a vehicle has traveled during the time interval T. Any uncertainty in the measurements of time and distance will be reflected in the uncertainty in the speed estimate according to the following equation:

$\begin{matrix} {{\Delta \; {Speed}} = \sqrt{\left( \frac{\Delta \; D}{T} \right)^{2} + {\left( \frac{D}{T} \right)^{2}\left( \frac{\Delta \; T}{T} \right)^{2}}}} & (3) \end{matrix}$

For example, if the time is accurate up to 1/100 of a second, i.e., ΔT=0.01 sec, for a vehicle that travels at 40 MPH along 50 feet of distance, the uncertainty in the vehicle position measurements needs to be less than 0.9 feet in order for the error in speed to stay below +/−1 MPH.

To measure the distance D that a vehicle has traveled within time T from video, the system needs to know the displacement of a reference point of the target vehicle during the time interval, assuming the reference point remains motionless relative to the target vehicle. In other words, given the pixel coordinate (u, v) of a reference point, the system needs to know its real-world coordinate (x, y) relative to some predefined origin. Under the assumption of a pinhole camera, the transformation between image and real-world coordinates is a projective mapping. The general form of a two-dimensional projective mapping is a rational linear mapping:

$\begin{matrix} {{x = \frac{{au} + {bv} + c}{{gu} + {hv} + i}},{y = \frac{{du} + {ev} + f}{{gu} + {hv} + i}}} & (4) \end{matrix}$

Although there are nine coefficients in the above mapping, it can be assumed that i=1 without loss of generality for practical purposes. Hence, there are only eight degrees of freedom in the two-dimensional projective mapping, and these eight coefficients can be determined from the image coordinates and real-world coordinates of the four vertices of a quadrilateral. Let us denote the image and real-world coordinates of the four vertices of a quadrilateral as (u_(k), v_(k)) and (x_(k), y_(k)), respectively, for k=0, 1, 2, 3. Assuming i=1, the eight coefficients can be determined by solving the following set of linear equations:

$\begin{matrix} {{\begin{pmatrix} u_{0} & v_{0} & 1 & 0 & 0 & 0 & {{- u_{0}}x_{0}} & {{- v_{0}}x_{0}} \\ u_{1} & v_{1} & 1 & 0 & 0 & 0 & {{- u_{1}}x_{1}} & {{- v_{1}}x_{1}} \\ u_{2} & v_{2} & 1 & 0 & 0 & 0 & {{- u_{2}}x_{2}} & {{- v_{2}}x_{2}} \\ u_{3} & v_{3} & 1 & 0 & 0 & 0 & {{- u_{3}}x_{3}} & {{- v_{3}}x_{3}} \\ 0 & 0 & 0 & u_{0} & v_{0} & 1 & {{- u_{0}}y_{0}} & {{- v_{0}}y_{0}} \\ 0 & 0 & 0 & u_{1} & v_{1} & 1 & {{- u_{1}}y_{1}} & {{- v_{1}}y_{1}} \\ 0 & 0 & 0 & u_{2} & v_{2} & 1 & {{- u_{2}}y_{2}} & {{- v_{2}}y_{2}} \\ 0 & 0 & 0 & u_{3} & v_{3} & 1 & {{- u_{3}}y_{3}} & {{- v_{3}}y_{3}} \end{pmatrix}\begin{pmatrix} a \\ b \\ c \\ d \\ e \\ f \\ g \\ h \end{pmatrix}} = \begin{pmatrix} x_{0} \\ x_{1} \\ x_{2} \\ x_{3} \\ y_{0} \\ y_{1} \\ y_{2} \\ y_{3} \end{pmatrix}} & (5) \end{matrix}$

The inverse mapping from real-world coordinates (x, y) to image coordinates (u, v) is also of a rational linear form and the coefficients can be similarly determined. If more than four point correspondences can be established, the two linear equations from each point correspondence can be stacked up, and the resulting set of linear equations will be over-determined. In this case, the coefficients can be determined by solving the over determined linear system using the singular-value-decomposition (SVD) method. In the most general case, according to Equation 5, the image to real-world mapping, and its inverse, can be established from four point correspondences, i.e., from the pixel and real-world coordinates of four points in general position.

In an outdoor traffic law enforcement scenario, however, much of difficulty lies in measuring the real-world coordinates of the reference points. In general, it is much easier to measure the distance between points than to measure their absolute x and y coordinates relative to some predefined origin. To circumvent the difficulty and to take advantage of the fact that traffic lanes are usually drawn in parallel, in one embodiment, four points that define a parallelogram on the road surface are used instead of four arbitrary points that define a general quadrilateral. Therefore, as discussed above, the system can operate with only two pairs of reference markers placed along each side of the road (or lane) with the same distance. In one currently preferred embodiment, the two pairs of markers are arranged to define a rectangle on the road surface, i.e., the two sides of the parallelogram across the road are perpendicular to the direction of travel. This arrangement not only greatly simplifies the physical measurement procedure, but also simplifies the determination of the projective mappings. However, it is to be appreciated that a strict rectangle is not required for validity of the following derivation, and the present disclosure is not limited to the use of a rectangle.

According to one embodiment, by making use of vanishing points, the projective mappings can be derived from pure geometrical constructions. Referring to FIG. 4, there is illustrated a two-dimensional projective mapping between the image and real-world coordinates. In the real-world space, as shown in the right plot, the four points define a parallelogram, i.e. 01= 23 and 02= 13. The left plot shows the projected points in the image space. As can be seen from the figure, the four points generally do not form a parallelogram. Let us define point A to be the intersect point of lines 01 and 23, and point B to be the intersect point of lines 02 and 13. In the real-world space, two lines are drawn parallel to the two sides 02 and 23 of the parallelogram, these lines intersecting at points M and N respectively. The two corresponding projected lines are shown in the left image space. The projective mappings can be easily derived from the following equations for points on the lines 01 and 02 respectively:

$\begin{matrix} {\frac{u_{0} - u}{u - u_{A}} = {\left( \frac{y_{P} - y_{0}}{y_{1} - y_{0}} \right)\left( \frac{u_{0} - u_{1}}{u_{1} - u_{A}} \right)}} & (6) \\ {\frac{v_{0} - v}{v - v_{A}} = {\left( \frac{y_{P} - y_{0}}{y_{1} - y_{0}} \right)\left( \frac{v_{0} - v_{1}}{v_{1} - v_{A}} \right)}} & (7) \\ {\frac{u_{0} - u}{u - u_{B}} = {\left( \frac{x_{P} - x_{0}}{x_{2} - x_{0}} \right)\left( \frac{u_{0} - u_{2}}{u_{2} - u_{B}} \right)}} & (8) \\ {\frac{u_{0} - u}{u - u_{B}} = {\left( \frac{x_{P} - x_{0}}{x_{2} - x_{0}} \right)\left( \frac{u_{0} - u_{2}}{u_{2} - u_{B}} \right)}} & (9) \end{matrix}$

Projective mappings for points on the lines 23 and 13 can be similarly derived. For general points that do not fall on the sides of the quadrilaterals, their coordinates can be inferred by intersecting lines that connect points on the sides of the quadrilaterals, using computational techniques known to those skilled in the art. FIG. 5 illustrates a projective mapping by projecting a rectangular grid 118 on the road 102 to the image space, and overlaying it onto the road surface.

Referring again to FIG. 3, in step 204 the tracking camera is calibrated. In the camera calibration step 204, the system establishes a one-to-one correspondence between each pixel and its real-world position on the road surface. Therefore, for each image the software engine retrieves from the tracking camera, it is possible to reconstruct the parallelogram region defined by the four reference markers 104 on the road 102 (see FIG. 1). In one embodiment, a rectification process (step 206) samples the parallelogram at a specified scale along both directions in the real-world space. In one example, this sampling scale is 0.5 feet. After the image coordinates are determined from the projective mapping, pixels in the image at the projected coordinates are sampled, for example, using a Gaussian filter, to represent the corresponding points on the road. Referring to FIG. 6A, there is illustrated a random frame from the tracking camera showing a vehicle 110 in the tracking zone. FIG. 6B illustrates a rectified image from the random sample image of FIG. 6A, showing the projection of the vehicle onto the road surface. According to one embodiment, the rectified images contain only pixels that correspond to the marked road surface in the original images. Each pixel in the rectified images may also bear geometrical information. For example, each pixel may represent a 0.5 feet by 0.5 feet patch on the road plane. Therefore, distance between two arbitrary pixels in the rectified images can be easily calculated.

To be able to calculate vehicle speeds, the system needs first to detect and identify moving vehicles from the images retrieved from the tracking camera. In one embodiment, detection of moving vehicles is achieved by performing segmentation on the rectified images. The segmentation algorithm generates and maintains a background image (step 208) of the road that represents the road surface without any moving object. At each frame, it then compares the currently retrieved image to the background image and identifies pixels with large deviations as foreground pixels. The resulting binary images from the segmentation algorithm, which consist of foreground and background pixels, are called object masks. Typically, foreground pixels caused by moving objects form connected regions in the object mask images. This information can be used to detect moving vehicles (step 210). Depending on whether vehicles are approaching or receding from the camera, the front or rear of foreground regions can be chosen as reference points for the underlying moving vehicles and tracked during subsequent frames for speed measurement until they exit the rectified images.

There have been many methods and techniques developed for background extraction and motion detection. These methods range from the simplest frame differencing method, to median filtering, and to the more involved Mixture of Gaussian estimation method. These methods are well explored in the computer vision field. They differ in computational complexity and performance under challenging environments and could all be used in the video speed detection system according to aspects of the present disclosure for segmenting motion pixels from the background. In one example, a recursive approximation of the temporal median filtering algorithm is used for background extraction (step 208) due to its low computational complexity and robustness provided by the non-linear filtering. The algorithm can be described as follows:

$\begin{matrix} {{{Background}\mspace{14mu} {Extraction}\mspace{14mu} {Algorithm}\mspace{11mu} \text{:}}{{Initialization}\mspace{11mu} \text{:}}\; \text{}{{for}\mspace{14mu} {each}\mspace{14mu} {pixel}\mspace{14mu} x\; \text{:}}{{M_{0}(x)} = {I_{0}(x)}}{{\Delta_{0}(x)} = 0}{{V_{0}(x)} = 0}{{For}\mspace{14mu} {each}\mspace{14mu} {frame}\mspace{14mu} t\text{:}}{{for}\mspace{14mu} {each}\mspace{14mu} {pixel}\mspace{14mu} x\text{:}}{{{{if}\mspace{20mu} {M_{t - 1}(x)}} < {I_{t}(x)}},{{M_{1}(x)} = {{M_{t - 1}(x)} + 1}}}{{{{if}\mspace{20mu} {M_{t - 1}(x)}} > {I_{t}(x)}},{{M_{1}(x)} = {{M_{t - 1}(x)} - 1}}}{{\Delta_{t}(x)} = {{{{M_{t}(x)} - {I_{t}(x)}}}ä}}{{{for}\mspace{14mu} \Delta \; {t(x)}} \neq 0}{{{{if}\mspace{20mu} {\Delta_{t - 1}(x)}} < {N \times {\Delta_{t}(x)}}},{{V_{t}(x)} = {{V_{t - 1}(x)} + 1}}}{{{{if}\mspace{20mu} {\Delta_{t - 1}(x)}} > {N \times {\Delta_{t}(x)}}},{{V_{t}(x)} = {{V_{t - 1}(x)} - 1}}}{{{{if}\mspace{20mu} {\Delta_{t - 1}(x)}} < {V_{t}(x)}},{{D_{t}(x)} = 0}}{{{{if}\mspace{20mu} {\Delta_{t - 1}(x)}} > {V_{t}(x)}},{{D_{t}(x)} = 1}}} & (10) \end{matrix}$

Where I_(t)(x) is the rectified image at frame t, M_(t)(x) is the estimated rectified background image, Δ_(t)(x) is the absolute difference between I_(t)(x) and M_(t)(x), V_(t)(x) is the estimated variance, and D_(t)(x) is the object mask. At each frame t and pixel x, the difference Δ_(t)(x) is compared to the variance V_(t)(x) to determine whether or not the pixel belongs to a foreground object. If the difference Δ_(t)(x) is significant compared to the variance V_(t)(x), the pixel is classified as foreground, i.e., D_(t)(x)=1. Otherwise, the pixel is classified as background. FIG. 7A illustrates an example of the background image, M_(t)(x), and FIG. 7B illustrates an example of the corresponding object mask, D_(t)(x), with white pixels representing the foreground and black pixels representing the background. The input image corresponds to the frame shown in FIG. 6A. Notice that the vehicle is missing from the background image (FIG. 7A) but is correctly detected in the object mask (FIG. 7B).

After the image segmentation step, pixels that correspond to foreground objects are identified. According to one embodiment, the system then further analyzes the image strip for each individual lane to detect possible moving vehicles and identify their positions. On each row of the rectified object mask, it counts the maximum number of consecutive foreground pixels to see if the number exceeds a certain threshold value. A row is marked as a foreground row, meaning it is occupied by moving objects, if the maximum number of consecutive foreground pixels exceeds the threshold. Otherwise it is designated as a background row. This process is repeated separately for each lane. At the end of this process, each lane will be converted into a one-dimensional object mask with each item corresponding to a row in the original two-dimensional rectified image. Next, a one-dimensional median filtering can be applied to each one-dimensional array to eliminate isolated foreground and background pixels.

The system can detect presence of moving vehicles in each lane from the corresponding 1D array (step 210). This may be done by finding chunks of elements in the array that correspond to foreground objects. In one embodiment, after the chunks are isolated and identified, their lengths are computed and compared to a threshold so that segments with extremely small lengths are removed because they are likely result from noise rather than from genuine moving objects.

Referring again to FIG. 7A, it is easy to see that a large portion of the left lane is classified as foreground pixels due to projection of the vehicle from the right lane. If left uncorrected, this could lead to false object detection in the left lane. The problem can be resolved by checking the shape around the front or rear end of the identified foreground object. The object mask caused by a genuine moving vehicle will have a flat front or rear end, such as shown in the right lane of the image in FIG. 7B. On the other hand, there is no flat end in the left lane due to the nature of projection.

With moving vehicles detected at each frame, their corresponding speeds can be easily calculated (step 212) by dividing the distance they have traveled since the first time they are detected by the corresponding time that has elapsed. For example, if a vehicle is first detected by the system at frame t₀ at position (x₀, y₀), and at frame t it is detected at (x_(t), y_(t)), the average speed of the vehicle between t₀ and t can be calculated as:

$\begin{matrix} {{V(t)} = \frac{\sqrt{\left( {x_{1} - x_{0}} \right)^{2} + \left( {y_{1} - y_{0}} \right)^{2}}}{t - t_{0}}} & (11) \end{matrix}$

FIGS. 8A-8F illustrate an example of how the average speed of a vehicle 110 can be calculated according to one embodiment. FIGS. 8A and 8B illustrate two image frames showing a vehicle 110 entering and leaving the monitored road segment marked by the four traffic cones 114. FIG. 8A illustrates the frame at which the vehicle 110 enters the monitored rectangular region in full. FIG. 8B illustrates a subsequent frame showing the vehicle 110 approaching the exit of the monitored region. In this example, each frame is accurately time stamped to one one-hundredths of a second. The date and time stamp 116 appears at the upper-left corner of each image frame. In one embodiment, a template-based OCR (optical character recognition) routine is applied to each image to read out its time stamp.

FIGS. 8C and 8D illustrate the rectified images of the marked road segment corresponding to the frames shown in FIGS. 8A and 8B, respectively. FIGS. 8E and 8F illustrate the corresponding two-dimensional object masks (for FIGS. 8C and 8D, respectively) with white pixels indicating foreground and black pixels being background. In one example, the reference point, based on which the vehicle's speed is calculated, is chosen to be the bottom of the foreground segments (marked as line segments 120) of the one-dimensional object masks constructed from the two-dimensional object mask images. The real-world coordinates of the reference point can be directly read from its pixel coordinates in the respective object mask images. In the illustrated example, the respective coordinates at the two frames (FIGS. 8A and 8B) are (16, 3) and (16, 29) feet measured relative to the origin. The reference point is mapped back to the original input images and is marked as dots 122 in the two images in FIGS. 8A and 8B.

Referring to FIG. 9, there is illustrated a synthetic image demonstrating how the average speed of the vehicle may be calculated (step 212). In the illustrated example, the time lapse T between the two frames can be easily calculated from the time stamps:

T=25:00.22−24:59.80=0.42 second  (12)

The distance D that the vehicle 110 has traveled between the two frames can be calculated from the real-world coordinates of the reference point at the two frames:

D=√{square root over ((16−16)²+(20−3)²)}{square root over ((16−16)²+(20−3)²)}=26 feet  (13)

Therefore, according to equation 11, the average speed V of the vehicle 110 between these two frames can be determined:

$\begin{matrix} {V = {\frac{26\mspace{14mu} {feet}}{0.42\mspace{14mu} {second}} = {42.2\mspace{14mu} {MPH}}}} & (14) \end{matrix}$

According to one embodiment, the speed of a vehicle is updated at each frame from when the vehicle is first detected by the system until it exits the monitored zone. At that point, the exit speed of the vehicle may be calculated and compared to the preset trigger speed (step 214) to determine if the vehicle is speeding and, for example, if a citation needs to be issued. In case that the vehicle is speeding and video evidence needs to be collected, the system may calculate the appropriate length of the recording to ensure that the video clip will record the speeding vehicle (step 216) from when the vehicle enters the monitored zone until the vehicle exits the monitored zone. If supplemental enforcement cameras are connected, the system may also trigger the enforcement cameras to take photographs of the vehicle so that registration and/or driver information can be obtained (step 218).

According to one embodiment, to properly associate detected moving vehicles across subsequent image frames, an object matching procedure can be performed to compare image profiles of moving objects across frames. If the profiles are similar enough, it is safe to conclude that they correspond to the same moving vehicle. Otherwise, no association will be established. In another example, to further increase the accuracy of the speed measurements, robust estimation techniques, such as RANSAC, can be applied to the speeds obtained according to Equation 11.

The above description details how the system may detect moving vehicles and calculate their speeds at roadside. As discussed above, if the average speed of a vehicle exceeds the preset trigger speed at the time the vehicle exits the marked zone, a video clip may be recorded from the tracking camera showing the vehicle from entering to leaving the marked zone. Because the system according to embodiments of the present disclosure may measure a vehicle's speed based solely on video frames the system receives from the tracking camera, and all the video frames can be recorded in case of a violation, the system can use the same method to calculate the vehicle speed offline from the recorded video sequence. In this case, according to one embodiment, instead of using live image frames from the tracking camera, the software engine may use the image frames from the recorded video clip to detect vehicles and calculate their speeds. Because the recorded image frames are exactly the same frames that could be used to calculate the vehicle speeds at roadside, the system can reproduce the same speeds for the vehicle whether the calculation is done roadside or at a later time. Therefore, unlike any other speed measuring device, the system according to embodiments of the present disclosure may provide a method for obtaining verifiable vehicle speed information.

In addition, according to another embodiment the system provides a supplemental utility that allows one to manually calculate vehicle speeds from recorded video clips. In one example, this manually calculation is based on the same time-over-distance method used in VASCAR. FIGS. 10 and 11 are example screen shots that may be displayed on a user interface coupled to the system processing unit and illustrate how a vehicle's speed can be calculated using the method and software engine according to aspects of the present disclosure. In the two images, two virtual lines 124 across the road 102 define a pre measured distance along the road within which the speed of the target vehicle 110 may be calculated. When the front tire of the vehicle 110 hits the first line 124, the time t₁ can be read from the time stamp 116, which is located at the upper-left corner of the image in FIG. 10. When the front tire of the vehicle 110 hits the second line 124, the time t₂ can be read from the time stamp 116 of the image in FIG. 11. Therefore, the vehicle's speed can be simply calculated as:

$\begin{matrix} {V = \frac{L}{t_{2} - t_{1}}} & (15) \end{matrix}$

It is also to be appreciated that other reference points, such as the shadow of a vehicle 110, can also be used to measure the speed of the vehicle. In one example, if there is no frame at which the reference point aligns perfectly with one of the two lines 124, an estimate of the vehicle's speed can be obtained in favor of the driver by underestimating the distance or overestimating the time elapse. In addition, in another embodiment, a virtual grid overlay similar to that shown in FIG. 5 can be generated to allow for more accurate speed measurements.

Having thus described several aspects of at least one embodiment, it is to be appreciated that various alterations, modifications, and improvements will readily occur to those skilled in the art. Such alterations, modifications, and improvements are intended to be part of this disclosure, and are intended to be within the spirit and scope of the present disclosure. Accordingly, the foregoing description and drawings are by way of example only.

It will be obvious to those having skill in the art that many changes may be made to the details of the above-described embodiments without departing from the underlying principles of the invention. The scope of the present invention should, therefore, be determined only by the following claims. 

1. A multiple-object speed tracking apparatus comprising: a camera configured to capture a set of images of a monitored area, wherein a longitudinal axis of the camera may be positioned at any viewing angle in relation to a longitudinal axis of a roadway as long the viewing angle allows images of at least two moving objects moving simultaneously within the monitored area to be included in one of either a set of low resolution images or a set of high resolution images; a computer system configured to analyze the set of images to detect the at least two moving objects and substantially simultaneously determine a calculated rate of speed of at least one of the two moving objects, wherein the computer system provides an on-site speed calibration process configured to transform locations in an image from the set of images into real-world coordinates by considering, in a single step, both perspective and scale of the image; and an apparatus mount upon which at least one of either the camera or the computer system is positioned, wherein the apparatus mount is one of either a fixed mount or a mobile mount.
 2. The multiple-object speed tracking apparatus of claim 1: wherein the computer system is further configured to compare the calculated rate of speed to a predetermined speed value, wherein the computer system includes a homographic calibration process of determining the calculated rate of speed that includes a linear fit comparison of a set of elements related to a distance traveled to an observed time to travel the distance, and wherein the computer system includes a process to ensure better accuracy of the calculated rate of speed by not indicating an object has exceeded the predetermined speed value until determining that a value of a residue of the liner fit does not exceed a preset maximum value of the residue of the linear fit.
 3. The multiple-object speed tracking apparatus of claim 2 wherein a set of incident data is generated when the calculated rate of speed exceeds the predetermined speed value. 